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ABSTRACT OF THE DISCLOSURE 

IR-L 5099-10 confers hypertransposing activity on 
5 transposable elements comprising , IR-L 5099-10 . IR-L 5099- 
Ifi, is a57 base pair sequence generated by a spontaneous 
mutation in T N5099-7 . Vectors comprising IR- L5099-10 
are useful in a variety of bacteria including but not 
limited to Streptomyces , Chainia, Saccharopolyspora , 
10 Steptoverticillium and Mycobacterium- 
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We claim: 

5 

1. The hypertransposing DNA sequence of Sequence 
ID No: 5 or a functional fragment thereof. 

2. The DNA sequence. of Claim 1 that is IR-L 5099- 

10 m. 

3 . A transposable element comprising the DNA 
sequence of Claim 1 . 

15 4. The transposable element of Claim 3 that is a 

transposon. 

5. The transposon of Claim 4 that is T n5099-10 . 

20 6. A vector comprising the transposon of Claim 5. 

7. A host cell containing a transposable element 
of Claim 3 . 

25 8. The host cell of Claim 7, said host cell being 

selected from the group consisting of Strepto/nyces, 
Streptoverticullium, Saccharpolyspora and Mycobacterium. 

i? 

9. A method of introducing a gene of interest into 
30 a host cell, said method comprising transforming said 
host cell with a vector of Claim 6. 
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10. The method of Claim 9 wherein said gene of 
interest is a gene encoding an enzyme in the 
biosynthetic pathway of a secondary metabolite. 

5 
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Tiule 

DNA SEQUENCES CONFERRING HYPERTRANSPOSABILITY AND 
q VECTORS CONSTRUCTED THEREWITH 



Transposable elements, such as transposoris, 
insertion sequences, and transposing bacteriophages, are 
found in both prokaryotic and eukaryotic cells. These 

10 elements are capable of insertion events at few or many 
different sites along the genome. This ability to 
transpose and insert, even in the absence of DNA 
homology, has allowed transposable elements to be 
exploited as powerful genetic tools. Some examples of 

15 the utility of such elements include gene mapping, use 
as promoter probes, construction of insertion mutations, 
the mediation of intergeneric recombination, the 
insertion of genetic information into the genome of a 
recipient strain, and the genetic tagging of specific 

20 genes - 

The development and exploitation of recombinant DNA 
technology in Streptomyces and related genera depends 
upon the availability of genetic tools for the 
introduction of foreign DNA or of multiple copies of 

25 native DNA into the genome of these cells. This 

development has been retarded by the very low number of 
transposable elements presently available for use in 
Streptomyces and related genera such as 
Streptoverticillitwi, Saccharopolyspora, Chainia and 

30 Mycobacteria, 

The transposable elements available for utilization 
in Streptomyces include: IS110 (Brut on, C. and K. 
Chater, 1987, Nucleic A cids Res .. 15:7053-7065); iSlfil 
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(Sladkova, A. 1986 Mol. Biol .. 20:878-881); T n4556 
(Chung, S. 1987 J. Bacteriol . . 169:443 6-4441); the 2.6 
kilobase minicircle from Streptomyces coelicolor 
5 (Lydiate, D., et al., 1986, Mol. Gen. Genet-. . . 203:79-88) 
and IS 493 and transposons derived therefrom. 

Insertion sequence IS493 was isolated from 
Streptomyces lividans using a transposon trap. 
Solenberg, P. and S. Burgett, 1989, J. Bacteriol .. 171: 

10 4807-4813. A variety of transposons have been developed 
utilizing ISA22. See Solenberg, P. and S. Burgett , 1989, 
J. Pagteriol. , 171:4807-4813; McHenney, M. and R. Baltz, 
1991, J. BacterioU, 173: 5578-5581; Hahn, D. et al, 
1991, J, Bacterj,Ql., 173:5573-5577; and Solenberg, P. 

15 and R. Baltz 1991 J, 3aqterjj,pJ, . , 173:1096-1104. The 
aforementioned transposons have proven to be useful in 
genetic manipulations of the various Act inomycet ales . 
Transposons derived from IS 493 exhibit transposition 
frequencies of about 10~ 6 . 

20 Transposon T n5099 . a xylE promoter probe 

transposon, which was described in Hahn et al., supra, 
was modified to comprise a promoter driving constitutive 
production of xvlE . Transposon T n5099-7 is the 
designation for the T n5099 derivative containing the 

25 cons ti tut ively expressed xvlE gene. The constitutive 
expression of xvlE resulted in a selective pressure 
against xyJjE production and ultimately led to a deletion 
in the region of the xvlE promoter and coding sequence. 
The deletion included a portion of the 5' inverted 

30 repeat of Tn5H2£J and generated a modified inverted 

repeat with changes in the 50-57 base area (reading left 
to right from the sense strand) . The altered 5 ' 
inverted repeat is designated IR-L 5099-10 and confers 
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hypertransposability on transposons and other elements 
comprising IR-L 5099-10 . 

IR-L 5099-10 , an inverted repeat generated by a 
5 spontaneous deletion in Tn 5099-7 , confers 

hypertransposing activity on transposable elements 
constructed therewith. 

Brief Description of the Figures 

10 

For the purposes of this disclosure, the Figures 
are not drawn exactly to scale. 

Figure 1 -- the restriction site and function map 
of plasmid pCZA141. 
15 Figure 2 -- the restriction site and function maps 

of T n5096 and T n5097 . 

Figure 3 — the restriction site and function map of 
T n5099-10 . 

20 For purposes of the present invention, as disclosed 

and claimed herein, the following terms are defined 
below. 

Am - the apramycin-resistant phenotype or gene 
conferring same. The Am R gene is also designated 
25 aac(3)lV. 

R 

Ap - the ampici 11 in-resistant phenotype or gene 
conferring same. 

Gene - a DNA segment which encodes a promoter 
function, a ribosome binding site and a translatable 
30 gene product. 

Hm R - the hygromycin-resistant phenotype or gene 
conferring same. 
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Hypertransposition - transposition by a 
transposable element at a frequency of 10~ 5 or greater. 

Insertion sequence - small (generally less than 
5 2000 bp) elements which encode no known genes unrelated 
to insertion function. 

Inverted Repeats - DNA sequences at the termini of 
insertion sequences and transposons which facilitate 
insertion of the insertion sequence or transposon into 
io host cell DNA. Inverted repeats are abbreviated IR and 
are sometimes designated IR-L and IR-R to denote which 
termini of the insertion sequence or transposon they 
define with IRL being the left (5') repeat and IR-R 
being the right (3') repeat. 
15 Ori - an origin of replication. 

Origin of replication - a DNA sequence which allows 
the replication of an extrachromosomal element in a 
specific host cell. 

Recombinant DNA cloning vector - any autonomously 
20 replicating agent, including, but not limited to, 

plasmids and phages, comprising a DNA molecule to which 
one or more additional DNA segments can be or have been 
added . 

Selectable marker - a gene or combination of genes 
25 of known function, the expression of which can be easily 
determined. 

Transformation - the introduction of DNA into a 
recipient host cell. 

Transposable element - a transposon, an insertion 
30 sequence, or a discrete piece of double stranded DNA 

which is capable of insertion at few or many sites in a 
genome . 
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Transposon - a transposable element, generally 
larger than 2 kb that may contain additional 
determinants unrelated to insertion function such as 
5 antibiotic or heavy metal resistance-conferring genes or 
biosynthetic genes. 

The present invention provides a novel sequence 
conferring hyper transposing properties and is designated 
IR-L 5099-10 . T n5099-10 comprises IR-L 5099-10 . Tn 5099-10 

io is illustrative of the transposons which can be 

constructed using the IR- L5099-10 hypertransposing 
sequence of the present invention. T n5099-10 and other 
vectors comprising the hypertransposing sequence of the 
present invention are useful for insertionally 

15 inactivating chromosomal genes as well as for inserting 
any sort of foreign DNA into the Streptomvces genome, 
gene-mapping, promoter probe construction, construction 
of insertion mutations, and the like. Another aspect of 
this invention is the use of this transposable element 

20 to create other novel transposons. The invention 

therefore helps to fulfill the long-felt need for such 
discrete, clearly defined elements for use in 
Streptomyces, thereby releasing the skilled artisan from 
reliance upon extra- chromosomal elements or homologous 

25 integration. Surprisingly, the IR-L 5099-10 element 

results in transposition frequencies approximately 10 3 
fold higher than other transposons derived from IS 493 . 
including T n509$ and TnSfi£2, and thus provides a means 
for facile achievement of higher transposition 

30 frequencies of genes of interest into host organisms. 

The steady state transposition frequency of T n5099- 
Ifl. was determined as described in Youngman, Philip, 
Plasmids:A Practical Approach (1987) K. Hardy, Ed., IRL 
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Press, Inc., Oxford, U.K. at pages 79-103 with 
modifications for growth in Streptomyces . Three 
replicate experiments using a temperature sensitive 
delivery vector were performed. The hypertransposing 
activity of Tn 5099-10 was demonstrated by comparison to 
Tn 5096 and T n5099 under equivalent experimental 
conditions. The results are summarized in Table 1. 



10 



15 



Transposon 
Tn 5096 
T n5099 
Tn5Q99-lQ 



Table 1 

Transposition Frequency 
4.3 x 1CT 6 
less than 10" 6 
2.5 x KT 3 



20 The experiments summarized above demonstrate that 

IR-L 5099-10 confers a transposition frequency 
approximately one thousand times greater than 
conventional IS 493 based vectors. Skilled artisans will 
appreciate that the thousand fold higher transposition 

25 frequency affords a facile means for increasing gene 

dosage {the number of copies of a given gene) in a cell 
and as such would be of immense benefit in improving 
production yields of metabolites such as antibiotics or 
immunosuppressives by increasing the gene dosage of the 

30 rate limiting enzyme in the biosynthetic pathway of the 
metabolite. 

T n5099-10 is similar to Tn 5099 . which has been 
described in the scientific literature. See Hahn, R. et 
al., 1991, J. Bact .. 173:5573-5577. Tn£1122 was 
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constructed by adding a xylE coding sequence, a 
hygromycin resistance gene and multiple cloning sites to 
IS4£1. Tn 5099 also contains a phage lambda pL promoter 
5 positioned 5' to the hygromycin resistance gene sequence 
to effect transcription of the hygromycin resistance 
gene product in E. coli, thereby enabling facile 
identification of E. coli transf ormants . The 
nucleotide sequence of is^ii is taught in Solenberg, P. 

10 and Burgett, S. J., 1991, Bact . 171:4807-4813. 

A restriction site and function map of T n5099-10 is 
provided in Figure 3 . The inverted repeats located at 
the termini of T n5099-10 are in opposite orientation as 
indicated by the arrows of Figure 3 . A lambda pL 

15 promoter is positioned 5' to the hygromycin resistance 
gene. The hygromycin resistance gene comprises its 
homologous promoter, which is operable in Actinomycetes . 
The phage lambda promoter was added for selection of E. 
coli transf ormants. The two open reading frames (OrfA 

20 and OrfB) are described in Solenberg and Burgett, supra, 
The translation product of OrfB is thought to be a 
transposase. The function of the OrfA gene product 
remains unelucidated. The OrfA functionality is not 
essential for transposition of IS 493 based transposons. 

25 The nucleotide sequence of T n5099-10 is provided 

below as Sequence ID No:l. 



30 
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GAGCGTTTTT CAACCTCAGG CTGACGGGGT CAGGGGTCAG AGGCGAGGTA 
ACCATCTGCG GTGATAAATT ATCTCTGGCG GTGTTGACAT AAATACCACT 
GGCGGTGATA CTGAGCACAT CAGATCGGTC GTCGAGGTCC ACAGGCCCTC 
5 GGCGACGTTC CGCCGGGCCT CGGCGACCGC CGCGTCGAGG CGCCGGTCGA 
AGGGGCAGTC CTCCACGGGC AGCTCGTGGA GGGCGCGGGC CAGCTCCGCC 
ATCGCCTCGA CCACGGCGAA CCGCTGGTGC TCGGGCCACT CCTCGGCCGC 
CGCGACGCCG GGGACGGCCT CCGTGACGAG CCACGCGGCG GTGTCGTCGG 
CACCGCGCTC GACGACGCGG GGGACGGGGA TCGGCGGGGC CTGGCGGCGC 

10 CTCGCCGTCG CAGAACCAGG CGGTGGCGTA CACCGTCGCC TCGGTCGGCC 
CGTAGAGATT GGCGATCCCG ACCGCAGCAC CACCGAGAAC GTCCCCGACG 
TGGCCGACCA GCCCGTCATC GTCAACGCCT GACCGCGGTG CGGACAGGCC 
GTGTCGCGAC CGGCCGTGCG GAATTAAGCC GGCCCGTACC CTGTGAATAG 
AGGTCCGCT GTG ACA CAA GAA TCC CTG TTA CTT CTC GAC CGT 

15 ATT GAT TCG GAT GAT TCC TAC GCG AGC CTG CGG AAC GAC CAG 
GAA TTC TGG GAG CCG CTG GCC CGC CGA GCC CTG GAG GAG CTC 
GGG CTG CCG GTG CCG CCG GTG CTG CGG GTG CCC GGC GAG AGC 
ACC AAC CCC GTA CTG GTC GGC GAG CCC GAC CCG GTG ATC AAG 
CTG TTC GGC GAG CAC TGG TGC GGT CCG GAG AGC CTC GCG TCG 

20 GAG TCG GAG GCG TAC GCG GTC CTG GCG GAC GCC CCG GTG CCG 
GTG CCC CGC CTC CTC GGC CGC GGC GAG CTG CGG CCC GGC ACC 
GGA GCC TGG CCG TGG CCC TAC CTG GTG ATG AGC CGG ATG ACC 
GGC ACC ACC TGG CGG TCC GCG ATG GAC GGC ACG ACC GAC CGG 
AAC GCG CTG CTC GCC CTG GCC CGC GAA CTC GGC CGG GTG CTC 

25 GGC CGG CTG CAC AGG GTG CCG CTG ACC GGG AAC ACC GTG CTC 
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ACC 


CCC 


CAT 


TCC 


GAG 


GTC 


TTC 


CCG 


GAA 


CTG 


CTG 


CGG 


GAA 


CGC 


CGC 


GCG 


GCG 


ACC 


GTC 


GAG 


GAC 


CAC 


CGC 


GGG 


TGG 


GGC 


TAC 


CTC 


TCG 


CCC 


CGG 


CTG 


CTG 


GAC 


CGC 


CTG 


GAG 


GAC 


TGG 


CTG 


CCG 


GAC 


GTG 


GAC 


ACG 


CTG 


CTG 


GCC 


GGC 


CGC 


GAA 


CCC 


CGG 


TTC 


GGC 


CAC 


GGC 


GAC 


CTG 


CAC 


GGG 


ACC 


AAC 


ATC 


TTC 


GTG 


GAC 


CTG 


GCC 


GCG 


ACC 


GAG 


GTC 


ACC 


GGG 


ATC 


GTC 


GAC 


TTC 


ACC 


GAC 


GTC 


TAT 


GCG 


GGA 


GAC 


TCC 


CGC 


TAC 


AGC 


CTG 


GTG 


CAA 


CTG 


CAT 


CTC 


AAC 


GCC 


TTC 


CGG 


GGC 


GAC 


CGC 


GAG 


ATC 


CTG 


GCC 


GCG 


CTG 


CTC 


GAC 


GGG 


GCG 


CAG 


TGG 


AAG 


CGG 


ACC 


GAG 


GAC 


TTC 


GCC 


CGC 


GAA 


CTG 


CTC 


GCC 


TTC 


ACC 


TTC 


CTG 


CAC 


GAC 


TTC 


GAG 


GTG 


TTC 


GAG 


GAG 


ACC 


CCG 


CTG 


GAT 


CTC 


TCC 


GGC 


TTC 


ACC 


GAT 


CCG 


GAG 


GAA 


CTG 


GCG 


CAG 


TTC 


CTC 


TGG 


GGG 


CCG 


CCG 


GAC 


ACC 


GCC 


CCC 


GGC 


GCC 


TGA 



CGC CCC GGG TGA AGA GCC CCGTCGGGCG GTGCCTGACG GGGCTTCTCA 
15 GTTAACCTAG GATCCATGGC GGCCGCAATA TTAATCAGGC TGACGGGGTC 
AGGGGTCAGA GGCGGGGTCT GGATCAGCGC GAGCGGTCGT GAGGTAGTAC 
CGGTTGCGCT CGAAGCTCAC TGCCGTCTAG GACAGAAGCT GACCCTAGCT 
CACGCCTACC GTCGCTGCTG GCAAGCAACA TCAGCGAAAG GACCGCG ATG 
CCC CAG CAG ACC AGC CCC GAG CCG GAG GGC TAC ACC ACC GTT 
20 GCG CCC TGG GTC GTC ACC GAC GAC ACG GGG GCT TTC CTC GAC 
TTC GTC ACC CAG GCA TTC GGG GGC GAG GAA CTC GGG CGG GTG 
TCG ACC GAG GAG GGC TTG ATC GGT CAC GCT GAG ATC CGA GTC 
GGC GAC ACC GTC GTG TTG GCC TTC GAC CGA CAC ACG GAC TGG 
CCC GCC ATG CCG AGC CTG CTG CGC GTA TTT GTC GCC GAT GCG 
25 GAC GAG GCG TTT TCC CAG GCC GTC GCA GCT GGC GGC TAT GTC 
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GTC ACT TCC CTG GCC GAC GAC GCC TTC GGA CAG CGC GGA GGG 
CGC ATC AAG GAC CCC TTC GGC AAT ATC TGG TGG GTG GTC AGC 
TGC GTT GAG GAC GTC TCT GAG GGC GAG ATG TGG AAG CGG CTG 
5 CAG GAC CCC GTA TAC GCC GAG GCC ATG CGC GTG GCT CAG GAG 
ACA CTC GAC GCC GAG CTC AGC GGT CGA CGC CAC GGG CGC AGC 
AGC GCA CCC GTC AGG ACA ACC AGC TAG ACCCACGACG 
TCGGGGTTCG CATGCGCCAG ACCGGTAGCC AAGGAGCGTC TTCATTCTGC 
TTCGGCAGCT TGCGGGTTGG CCTGACACAG GGGTGAAGCC CCTGGTAGAT 

10 GGGTTTTCGA CCAAGAAATC CGTCTCTACT AGAGGCTTCAC GTG CTT GTT 
TAC CCC TCC GGG CTG GAC GTG TCC AGT TCT GCC CTC CGC TTC 
. CTC TCC GCC CGT CTG CGG GAA CAC CGC CGT GGT CTC GGC ACC 
CGG TGG CGG CGC CTG AGC GCA GGT CGG CAG GCT CTG CTC GCC 
CTC GCC CAT CTC CGC AAC GGT CAT CCG TAT GTG CAG CTC GCG 

15 GCC GGG TTC GGC GTG GGC ACC ACC ACT GCC TAC CGC TAC GTC 
ACC GAG GCA GCT GAG GTT CTG GCC GCC CTT GCC CCG ACG CTC 
GCC GAG GCG GTC CGG GCG GCG TCG ATG AAG GCG TTC GTC CTG 
CTG GAC GGG ACG CTG CTG CCG ATC GAC CGA ATC GCC GCC GAC 
CGG CCT TTC TAC TCG GGC AAG CAC AAG AAA CAC GGG ATG AAC 

20 GTACAG GTG ATC GCA GAT CCT TCC GGC CGC CTG CTG TGG GCC 
TCG CCG GCC CTG CCC GGA GCC GTC CAC GAC GTC CGT GCG GCC 
CGC GAG CAC GGC ATC ATC GAC ACG CTT GCC ACA GCC GAC GTC 
AAC TGC TGG GCA GAC AAG GGC TAC CAG GGC GCC GGA GGC ACG 
GTC CGT GTT CCC TAC CGC GGC CGG TGG GAG ACG CTC TCT GCA 

25 GGG CAG CAG GCC GTC AAT CGA TCC CAC GCG AAG ATC CGG GCG 
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CTC GTC GAG CAA GCC GTC GCG ACC CTC AAG TCC TGG CGA CTC 

CTT CGC AGG CTC CGT TGC TCG ACC ACC CGC GTC ACG GTC CTC 

GTC CAG GCC GTC CTC TGC CTG CAT CAG GCC AGC TCA GCT TGA 
5 GGATGGAAAACGCTC 

The nucleotide sequence of Sequence ID No:l should 
be read in context with the restriction site and function 
map of T n5099-10 , which is provided in Figure 3. 

10 The hypertransposing activity of T n5099-10 has been 

localized to the first 57 nucleotides of the 5* 
insertion sequence. These first 57 nucleotides of 
Sequence ID No:l are designated IR-L 5099-10 . The region 
of Sequence ID No:l which confers hypertransposing 

15 activity f IR-L 5099-10 ) includes the sequence identified 
as IS 493-1Q splice site in Figure 3 , which was generated 
when the xylE gene was spontaneously deleted from 
T n5099-7 to generate T n5099-10 . The sequence of IR- 
L 5099-10 is set forth below as Sequence ID No: 5. 

20 

GAGCGTTTTT CAACCTCAGG CTGACGGGGT CAGGGGTCAG AGGCGAGGTA 
ACCATCT 

The combined teachings of Sequence ID No:l, 
25 Sequence ID No: 5 and Figure 3 enable the construction of 
unlimited varieties of vectors which utilize the 
hypertransposing function of IR-L 5099-10 {Sequence ID 
No: 5). Figure, 3 provides restriction endonuclease sites 
which allow the facile construction of IR- L5099-10 
30 vectors from IS 493 vectors by merely substituting the 

IR-L 5Q99-10 sequence for the inverted repeat at the left 
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end (as viewed in Figure 3) of IS 493 . Sequence ID No:l 
provides the nucleotide sequence of Tn 5099-10 , which 
comprises from left to right as viewed in Figure 3 : the 
5 Inverted Repeat ( IR-L 5099-10 ) which is Sequence ID No: 5; 
phage lambda's leftward promoter; a hygromycin 
resistance gene; a multiple cloning site; OrfA; OrfB and 
the pristine IR-R of IS122- The restriction site and 
function map of Figure 3 provides the oligonucleotide 

10 numbers (reading from left to right) which correspond to 
the restriction endonuclease sites and thereby renders 
utilization of the components of Sequence ID No:l to 
generate vectors comprising Sequence ID No: 5 a routine 
laboratory exercise. Example IB teaches the synthesis 

15 of a linker which is readily inserted into the unique 
Sau l site of plasmid pCZAl41 (NRRL B18417) to generate 
PCZA141-10. In the event skilled artisans seek 
additional information on 1S422. insertion sequences and 
transposons derived therefrom reference is directed to 

20 the teachings of Hahn et al., supra; Solenberg and 

Baltz, supra; McHenney and Baltz, supra; and Solenberg 
and Burgett, supra, the entire teachings of which are 
herein incorporated by reference. 

Example 1 teaches construction of transposons 

25 derived from the IS 493 sequence, vectors which 

transpose at higher frequency are readily prepared 
merely by substituting the IR-L 5Q99-1Q sequence of 
Sequence ID 5 for the corresponding region in vectors 
comprising IS 493 . References to literature describing 

30 T n5096 and T n5099 have been provided in an earlier ! 
section. Skilled artisans will appreciate the facile 
nature of converting such related transposons into IR- 
L 5099-10 vectors. 
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The IS 4 93. transposable element can be obtained from 
plasmid pZCAl41, which can be conventionally isolated 
from s. coli K12 JM109/pCZAl41 , a strain deposited and 
5 made part of the permanent stock culture collection of 
the NRRL. A culture of E. coli K12 JM109/pCZAl41 can be 
obtained from the NRRL under the accession number NRRL 
B18417. A restriction site and function map of plasmid 
PCZA141 is presented in Figure 1 of the accompanying 

10 drawings . 

The IS 493 element comprises two distinct open 
reading frames. The first open reading frame, 
designated OrfA, encodes 177 amino acids and is found 
between the unique Saul site and the unique £phl site. 

15 The second open reading frame, designated OrfB, encodes 
233 amino acids and is located at the opposite end of 
the IS12i element from OrfA. The apramycin 
resistance-conferring gene (aac(3)!V) ligated into this 
site can be oriented in either direction. T n5096 , found 

20 in plasmid pCZAl56, contains the aac(3) IV gene in an 

orientation opposite of OrfA and OrfB. Tn 5097 . found in 
plasmid pCZAl57, contains the aac (3) iv gene in the same 
orientation as OrfA and OrfB. Both T n5096 and Tn 5097 
are useful in integrating this foreign gene into 

25 Actinomycetales genomes. Many other novel transposons 
can be created by ligating any foreign gene of interest 
into ISA21 or IR-L 5099-10 based vectors. Restriction 
site and function maps of T n5096 and Tn 5097 are found in 
Figure 2 of the accompanying drawings. 

30 Transposition events, such as those made possible 

by 1S4S1 or the corresponding IR- L5099-10 vectors, are 
very useful because transposition may occur in a variety 
of locations in the genome and does not require DNA 
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homology for integration. In addition, integration of 
DNA by transposition mechanisms is not limited to 
organisms that have functional homologous recombination 
5 systems. 

The IR-L 5099-10 hypertransposing sequence is in no 
way limited to the use in the plasmids herein described, 
as many different techniques and cloning vectors could 
be used to introduce the transposable element into the 

10 host cell. Furthermore, the types of genes integrated 
into the chromosome via transposition using the IR- 
L 5099-10 element are also virtually limitless. The 
antibiotic resistance-conferring genes of the 
exemplified plasmids could be replaced by many different 

15 antibiotic resistance conferring genes or the antibiotic 
resistance-conferring genes could be replaced by heavy 
metal resistance-conferring genes, secondary metabolite 
biosynthetic genes, combinations thereof. The stable 
integration of secondary metabolite biosynthetic genes 

20 into the chromosome of a host cell can cause increased 
production of the desired gene product or the production 
of new, hybrid gene products. Those skilled in the art 
recognize that virtually any gene encoding any function 
can be transposed into the host chromosome using the 

25 IR-L 5099-10 element disclosed herein. Therefore, the 
addition of any extraneous DNA within the parameters of 
the IR-L 5099-10 element would be within the scope of the 
present invention. 

Many modifications and variations of the present 

30 illustrative DNA sequence are possible. Such 

modifications can introduce new restriction sites into 
the IR-L 5099-10 vectors and could greatly increase the 
utility of the element. Indeed, some modifications 
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could potentially alter the frequency of transposition 
controlled by IS 493 derivatives. These modified 
IR-L5099-10 elements may occur naturally such as 
5 IR-L 5099-10 which generated Tn 5099-10 or can be produced 
by synthetic methods in substantial accordance with the 
procedures of Itakura et al . , 1977, Science . 198:1056 
and Crea et al. , 1978, Proc, Nat. Acad. Sci . , USA 
75:5765. In addition, synthetic components or linkers 

10 can be synthesized either by using a Systec 1450A DNA 
synthesizer {Systec Inc., 3816 Chandler Drive, 
Minneapolis, MN) or an ABS 380A DNA synthesizer {Applied 
Biosystems, Inc., 850 Lincoln Center Drive, Foster City, 
CA 94404) . Many other DNA synthesizing instruments are 

15 known in the art and can be used to make synthetic DNA 
fragments. Therefore, the present invention is in no 
way limited to the DNA sequences and plasmids 
specifically exemplified. 

T n5099-10 resulted from the spontaneous deletion of 

20 the xylE coding region and its constitutive promoter 
from a transposon designated Tn 5099-7 . The deletion 
eliminated a sequence in the 50-57 base area. See 
Figure 3 and Sequence ID No: 5. Those of ordinary skill 
in the art realize that further deletions in this area 

25 would lead to IRs which would likewise confer 

hyper transposing activity to insertion sequences and 
transposons. The availability of aforementioned and 
numerous other DNA synthesizers as well as the growing 
numbers of commercial laboratories which perform custom 

30 DNA synthesis and exonucleases which "chew-back" the 
ends of DNA sequences allows the facile preparation of 
IR-L 5099-10 -like sequences which are smaller than the 
IR-L 5099-10 sequence of Sequence ID No: 5. Accordingly, 
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the present invention embraces functional subf ragments 
of Sequence ID No: 5. 

Plasraid pCZA141-10 is illustrative of the vectors 
5 which can be constructed utilizing the IR-L 5099-10 

sequence of the present invention. Plasmid pCZA141-10 
contains T n5099-10 which has IR-L 5099-10 as the IR-L and 
the IR-R of IS 493 . Skilled artisans realize that the 
IR-R of T n5099-10 can be readily replaced with IR-L 5099- 

10 lfi. to generate further hypertransposing transposons. 
Skilled artisans also realize that the non-Tn 5099-10 
components of plasmid pCZAl41-10 are readily 
interchangeable with other vector components which 
provide the same functions. The non-T n5099-10 

15 components of plasmid pCZAl41-10 are derived from pUC19, 
a commercially available vector with numerous functional 
equivalents also being available from depositories such 
as the ATCC and the NRRL as well as numerous commercial 
concerns such as New England Biolabs, Pharmacia, 

20 Boehringer Mannheim, etc.. Accordingly, the vector 
backbone (non-T n5099-10 ) is not a limitation to the 
vectors which can be used to introduce the IR-L 5099-10 
transposable elements of the present invention into host 
cells. 

25 The transposable elements constructed with the IRL- 

5099-10 sequence are useful in numerous species and 
strains of Streptomyces , Chainia, Saccharopolyspora, 
Steptoverticillium and Mycobacterium, The 
transformation and transfection protocols required for 

30 introduction of vectors into the aforementioned bacteria 
are well known in the art and will not be reproduced as 
part of this disclosure. 
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The examples which follow are intended to further 
illustrate the present invention and are not to be 
interpreted as limiting on the scope thereof, while the 
5 examples and detailed description sections of the present 
invention are sufficient to guide anyone of ordinary skill 
in the art in the practice of the present invention, 
skilled artisans are also directed to Molecular Cloning A 
Laborato ry Manual. Second Edition, Sambrook, J., Fritsch, 

10 E. F . , and Maniatis, T., Cold Spring Harbor Press, 1989, 
and Current Prot ocols In Molecular Biology . Ausubel, F.M., 
Brent, R. , Kingston, R.E., Moore, D. D. , Seidman, J.G., 
Smith, J. A., and Struhl, K. , Ed. Greene Publishing 
Associates and Wiley- Interscience, 1989. The 

15 aforementioned resources provide an excellent technical 
supplement to any discourse in genetic engineering. 
Streptomyces and related genera such. as Saccharopolyspora , 
Streptoverticilliuiii, and Chainia require adaptation of the 
techniques described in Sambrook, supra and Molecular 

20 Cloning, which are directed primarily towards E. coli and 
related gram negative bacilli. Genetic Manipulation of 
Strer>tomvces-A Laboratory Manual. Hopwood, D. A., et al., 
Ed. F. Crowe and Sons, Ltd., Norwich, U.K. provides an 
excellent supplement as it provides a "methods book" 

25 approach to the molecular biology protocols used in genetic 
manipulations of Streptomyces. 



30 
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Example 1 
CONSTRUCTION QR PLASMID DCZA141-10 

5 

A. Isolation of Plasmid pCZA141 

Plasmid pCZA141, which contains IS 493 on an 
approximately 2.1 kb Hin dlll restriction fragment is 
obtained from the Northern Regional Research Center, 

10 NRRL , from B. coli K12 JMl09/pCZAl41 under the accession 
number NRRL B-18417. A restriction site and function 
map of plasmid pCZA141 is presented in Figure 1 of the 
accompanying drawings. 

The lyophils of K12 JM109/pCZAl41 are plated onto 

15 L-agar plates (10 g of Bacto-tryptone, 10 g of NaCl, 5 g 
of Bacto-Yeast Extract, and 15 g of agar per liter) 
containing 25 |Ag/ml ampicillin to obtain a single colony 
isolate of the strain. This colony is used to inoculate 
about 500 ml of L broth (1% (w/v) Bacto tryptone; 0.5% 

20 (w/v) yeast extract; 0.5% NaCl and 0.1% (w/v) dextrose 
at pH 7) containing 25 |lg/ml ampicillin, and the 
resulting culture is incubated at 37°C with aeration 
until the cells reach stationary phase. 

Plasmid DNA is obtained from the cells in 

25 accordance with the following procedure, which is 

adapted from Maniatis et al., 1982, Molecular Cloning 
(Cold Spring Harbor Laboratory) . This same procedure is 
used, but on a smaller scale and with the ultracentri- 
fugation steps replaced with phenol followed by 

30 chloroform extractions, to prepare the plasmid DNA used 
to identify the E. coli trans formants . 

About 500 ml of stationary-phase E. coli Kl2 
JMl09/pCZAl41 cells are harvested by centrifugation at 
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4000Xg for 10 minutes at 4°C, and the supernatant is 
discarded. The cell pellet is washed in 100 ml of 
ice-cold STE buffer (0.1 M NaCl; 10 mM Tris-HCl, pH 7.8; 
5 and 1 mM EDTA) . After the cell pellet is washed, the 
pellet is resuspended in 10 ml of Solution 1 (50 mM 
glucose; 25 mM Tris-HCl, pH=8.0; and 10 mM EDTA) that 
contains 5 mg/ml lysozyme and is left at room 
temperature for 10 minutes. Twenty ml of Solution 2 
10 (0.2 N NaOH and 1% SDS) are then added to the 

lysozyme- treated cells, and the solution is gently mixed 
by inversion. The mixture is incubated on ice for 10 
minutes. 

Fifteen ml of ice-cold, 3 M sodium acetate, pH=4.8, 

is are added to the lysed-cell mixture, and the solution is 
mixed by inversion. The solution is incubated on ice 
for 60 minutes. The 3 M sodium acetate solution is 
prepared by mixing equal volumes of 3 M acetic acid and 
3 M sodium acetate. 

20 The lysed cell mixture is centrifuged in a Beckman 

SW27 rotor (or its equivalent) at 20,000 rpm for 20 
minutes at 4°c. About 36 ml of supernatant are 
recovered, and 2.5 volumes of ethanol are added, mixed, 
and the resulting solution left on ice for 15 minutes. 

25 The plasmid DNA is collected by centrifugation at 
12,000Xg for 30 minutes at room temperature. The 
supernatant is discarded, and the DNA pellet is washed 
with 70% ethanol at room temperature. The ethanol wash 
is decanted, and the pellet is dried in a vacuum 

30 desiccator. The pellet is then resuspended in 8 ml of * 
TE buffer. 

Eight grams of CsCl are added to the DNA solution. 
About 0.8 ml of a 10 mg/ml solution of ethidium bromide 
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in water are added for each 10 ml of CsCl -DNA solution. 
The final density of the solution is about 1.55 g/ml, 
and the ethidium bromide concentraton is about 
5 800 ng/ml. The solution is transferred to a Beckman 
Type 50 centrifuge tube, filled to the top with TE 
buffer containing 1.55 g/ml CsCl, sealed, and cen- 
trifuged at 45,000 rpm for 24 hours at 20°C. After 
centrifugation, two bands of DNA are visible in ordinary 

10 light and become even more prominent in UV light. The 
cap is removed from the tube, and the lower DNA band is 
recovered using a syringe with a #21 hypodermic needle 
inserted through the side of the centrifuge tube. 

The ethidium bromide is removed from the solution 

15 of plasmid DNA by several extractions with 

water-saturated 1-butanol, and the CsCl is removed by 
dialysis against TE buffer. After extractions with 
buffered phenol and then chloroform, the DNA is pre- 
cipitated, washed with 70% ethanol, and dried. About 

20 0.5 mg of plasmid pCZA141 DNA can be obtained by this 
procedure. 

B. Digestion of Plasmid pCZAl41 with Sau l 

About 5 ^g of plasmid pCZA141 are digested with 

25 Saul. The composition of digestion buffers and reaction 
conditions for restriction endonuclease cleavage are 
provided in New England Biolabs Catalogs and product 
literature as well as the Sambrook et al. t supra, and 
MQ^eculax Cloning* supra, references and will not be 

30 recited here. Plasmid pCZAl 41 contains a single ! 
site, which is near the 5* end of IS 493 and within the 
IR-L of IS422. Digestion of plasmid pCZA141 linearizes 
the vector to accommodate the DNA linker which convert 
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IS421 to T n5099-10 . Following digestion with Sau l the 
linear pC2Al41 was conventionally purified. 

5 C. Preparation of the Linker 

Ligation of the DNA sequence provided below into 
the unique Sau l site of plasmid pCZAl41 ISA22 generates 
Tn 5Q99-l Q. 



10 


TCAGGCTGAC 


GGGGTCAGGG 


GTCAGAGGCG 


AGGTAACCAT 


CTGCGGTGAT 




AAATTATCTC 


TGGCGGTGTT 


GACATAAATA 


CCACTGGCGG 


TGATACTGAG 




CACATCAGAT 


CGGTCGTCGA 


GGTCCACAGG 


CCCTCGGCGA 


CGTTCCGCCG 




GGCCTCGGCG 


ACCGCCGCGT 


CGAGGCGCCG 


GTCGAAGGGG 


CAGTCCTCCA 




CGGGCAGCTC 


GTGGAGGGCG 


CGGGCCAGCT 


CCGCCATCGC 


CTCGACCACG 


15 


GCGAACCGCT 


GGTGCTCGGG 


CCACTCCTCG 


GCCGCCGCGA 


CGCCGGGGAC 




GGCCTCCGTG 


ACGAGCCACG 


CGGCGGTGTC 


GTCGGCACCG 


CGCTCGACGA 




CGCGGGGGAC 


GGGGATCGGC 


GGGGCCTGGC 


GGCGCCTCGC 


CGTCGCAGAA 




CCAGGCGGTG 


GCGTACACCG 


TCGCCTCGGT 


CGGCCCGTAG 


AGATTGGCGA 




TCCCGACCGC 


AGCACCACCG 


AGAACGTCCC 


CGACGTGGCC 


GACCAGCCCG 


20 


TCATCGTCAA 


CGCCTGACCG 


CGGTGCGGAC 


AGGCCGTGTC 


GCGACCGGCC 




GTGCGGAATT 


AAGCCGGCCC 


GTACCCTGTG 


AATAGAGGTC 


CGCT GTG 



ACA CAA GAA TCC CTG TTA CTT CTC GAC CGT ATT GAT TCG GAT 

GAT TCC TAC GCG AGC CTG CGG AAC GAC CAG GAA TTC TGG GAG 

CCG CTG GCC CGC CGA GCC CTG GAG GAG CTC GGG CTG CCG GTG 

25 CCG CCG GTG CTG CGG GTG CCC GGC GAG AGC ACC AAC CCC GTA 

CTG GTC GGC GAG CCC GAC CCG GTG ATC AAG CTG TTC GGC GAG 

CAC TGG TGC GGT CCG GAG AGC CTC GCG TCG GAG TCG GAG GCG 
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TAC 


GCG 


GTC 


CTG 


GCG 


GAC 


GCC 


CCG 


GTG 


CCG 


GTG 


CCC 


CGC 


CTC 


CTC 


GGC 


CGC 


GGC 


GAG 


CTG 


CGG 


CCC 


GGC 


ACC 


GGA 


GCC 


TGG 


CCG 


TGG 


CCC 


TAC 


CTG 


GTG 


ATG 


AGC 


CGG 


ATG 


ACC 


GGC 


ACC 


ACC 


TGG 


CGG 


TCC 


GCG 


ATG 


GAC 


GGC 


ACG 


ACC 


GAC 


CGG 


AAC 


GCG 


CTG 


CTC 


GCC 


CTG 


GCC 


CGC 


GAA 


CTC 


GGC 


CGG 


GTG 


CTC 


GGC 


CGG 


CTG 


CAC 


AGG 


GTG 


CCG 


CTG 


ACC 


GGG 


AAC 


ACC 


GTG 


CTC 


ACC 


CCC 


CAT 


TCC 


GAG 


GTC 


TTC 


CCG 


GAA 


CTG 


CTG 


CGG 


GAA 


CGC 


CGC 


GCG 


GCG 


ACC 


GTC 


GAG 


GAC 


CAC 


CGC 


GGG 


TGG 


GGC 


TAC 


CTC 


TCG 


CCC 


CGG 


CTG 


CTG 


GAC 


CGC 


CTG 


GAG 


GAC 


TGG 


CTG 


CCG 


GAC 


GTG 


GAC 


ACG 


CTG 


CTG 


GCC 


GGC 


CGC 


GAA 


CCC 


CGG 


TTC 


GGC 


CAC 


GGC 


GAC 


CTG 


CAC 


GGG 


ACC 


AAC 


ATC 


TTC 


GTG 


GAC 


CTG 


GCC 


GCG 


ACC 


GAG 


GTC 


ACC 


GGG 


ATC 


GTC 


GAC 


TTC 


ACC 


GAC 


GTC 


TAT 


GCG 


GGA 


GAC 


TCC 


CGC 


tap 








CAA 


\- 1 Kj 


PAT" 
LAI 




AAC 




1 iC 


CGG 


GGC 


GAC * 


CGC 


GAG 


ATC 


CTG 


GCC 


GCG 


CTG 


CTC 


GAC 


GGG 


GCG 


CAG 


TGG 


AAG 


CGG 


ACC 


GAG 


GAC 


TTC 


GCC 


CGC 


GAA 


CTG 


CTC 


GCC 


TTC 


ACC 


TTC 


CTG 


CAC 


GAC 


TTC 


GAG 


GTG 


TTC 


GAG 


GAG 


ACC 


CCG 


CTG 


GAT 


CTC 


TCC 


GGC 


TTC 


ACC 


GAT 


CCG 


GAG 


GAA 


CTG 


GCG 


CAG 


TTC 


CTC 


TGG 


GGG 


CCG 


CCG 


GAC 


ACC 


GCC 


CCC 


GGC 


GCC 






TGACGCCCCG 



20 GGTGAAGAGC CCCGTCGGGC GGTGCCTGAC GGGGCTTCTC AGTTAACCTA 

GGATCCATGG CGGCCGCAAT ATTAA (Sequence ID No: 6) 

25 
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Oligonucleotide synthesis is well known in the art and 
the sequence of IR-L 5099-10 (Sequence ID No: 6) is 
readily synthesized by such methods. See Sambrook and 
5 frlaniatis, supra, and Current Protocols in Molecular 
Biology, supra. The linker sequence corresponds to 
Sequence ID No: 6. The linker is synthesized as a series 
of oligonucleotides which collectively encompass both 
strands of Sequence ID No: 6. The selection of 

10 appropriate oligonucleotides is a matter of choice. The 
present inventors did not synthesize the fragment since 
Tn5099-10 resulted from a spontaneous deletion in 
T n5099-7 as described in an earlier section of the 
specification. The linker provided herein is provided 

15 to supplement the combined teachings of Sequence ID No:l 
(IS42!) and Sequence ID No:5 (IR- L5099-10 ) . which are 
deemed enabling in and of themselves. 

D. Ligation of the 1675 bp linker into pCZA141 to 
20 generate plasmid pCZAl41-10 

The linearized pCZAl41 of Example 1A, which has 
S?ul overhangs and the 1675 bp linker of Example IB, 
which also has Sau l overhangs are ligated in 
accordance with procedures well known in the art. See 
25 Sambrook et al . , supra, at 1.53-1.72. 

E. Transformation of JS. coli MM294 with the ligation 
mixture of Example 1C. 

To prepare E. coli K12 MM294 cells that are 
30 competent for transformation, the lyophils of E. coli 
K12 MM294 obtained from the NRRL {accession number NRRL 
B-15625) are reconstituted to isolate single colonies. 
One single-colony isolate of MM294 is inoculated into 
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5 ml of broth and the culture is incubated at 37°C 
overnight with aeration. Fifty |il of the overnight 
culture are used to inoculate 5 ml of broth. The 
5 culture is incubated at 37°C overnight with aeration. 
The following morning, the culture is diluted to 200 ml 
with broth. The diluted culture is incubated at 37°C 
with aeration until the absorbance at 550 nm (A) is 
about 0.5. The culture is cooled for ten minutes in an 

10 ice-water bath, and the cells are then collected by 
centrifugation at 4000Xg for 10 minutes at 4°C. The 
cell pellet is resuspended in 100 ml of cold 10 mM NaCl 
and then immediately repelleted by centrifugation. The 
cell pellet is resuspended in 100 ml of 30 mM CaCl and 

15 incubated on ice for 20 minutes. 

The cells are again collected by centrifugation and 
resuspended in 10 ml of 30 mM CaCl. A one-half ml 
aliquot of the cells is added to the ligated DMA 
prepared above. The cell-DNA mixture is incubated on 

20 ice for one hour, heat-shocked at 42°C for 90 seconds, 
and then chilled on ice for about two minutes. The 
cell-DNA mixture is centrifuged, and the cell pellet is 
resuspended into 0.5 ml of broth in a 1.5 ml tube and 
incubated at 37°C for one hour. 

25 Aliquots of the transformation mixture are plated 

on L-agar plates containing 100 ng apramycin/ml. The 
plates are incubated at 37°C overnight. Several 
apramycin-resistant colonies are selected and then 
screened by restriction enzyme analysis. A resriction 

30 site and function map of plasmid pCZAl41 is provided iri 
Figure 1. The insertion of the 1675 bp linker of 
Example IB into the unique Saul site results in two 
different plasmids which differ only in the orientation 
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of the linker. The plasmid having the linker in proper 
orientation contains T n5099-10 . Digestion of the 
desired plasmid with Eco RI and Sau l results in DNA 
5 fragments of 3.6 kb and 2.9kb when resolved on an 

agarose gel. The plasmid having the linker in proper 
orientation and thus comprising T n5099-10 (Sequence ID 
No:l) is designated pCZAl41-10. Restriction endonuclease 
mapping is discussed in Sambrook et al., supra, at 1.85 
10 et. seq. 
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